A factorial sparse coder model for single channel source separation
نویسندگان
چکیده
We propose a probabilistic factorial sparse coder model for single channel source separation in the magnitude spectrogram domain. The mixture spectrogram is assumed to be the sum of the sources, which are assumed to be generated frame-wise as the output of sparse coders plus noise. For dictionary training we use an algorithm which can be described as non-negative matrix factorization with l sparseness constraints. In order to infer likely source spectrogram candidates, we approximate the intractable exact inference by maximizing the posterior over a plausible subset of solutions. We compare our system to the factorial-max vector quantization model, where the proposed method shows a superior performance in terms of signal-tointerference ratio. Finally, the low computational requirements of the algorithm allows close to real time applications.
منابع مشابه
Shift-invariant Sparse Coding for Single Channel Blind Source Separation
In this paper we present results on single channel blind source separation based on a shift-invariant sparse coding model [1], [2] and [3]. This model learns a set of time-domain features from a single observation of the mixed signals. The found features can often be associated with a single source and can therefore be used to reconstruct the individual source signals. This is shown in this pap...
متن کاملSingle Channel Source Separation Using Filterbank and 2D Sparse Matrix Factorization
We present a novel approach to solve the problem of single channel source separation (SCSS) based on filterbank technique and sparse non-negative matrix two dimensional deconvolution (SNMF2D). The proposed approach does not require training information of the sources and therefore, it is highly suited for practicality of SCSS. The major problem of most existing SCSS algorithms lies in their ina...
متن کاملVocal-tract Modeling for Speaker Independent Single Channel Source Separation
In this paper, we investigate two statistical models for the source-filter based single channel speech separation task. We incorporate source-driven aspects by pitch estimation in the model-driven method which models the vocal-tract part as a priori knowledge. This approach results in a speaker independent (SI) source separation method. For modeling the vocal tract filters Gaussian mixture mode...
متن کاملSource-Filter-Based Single-Channel Speech Separation Using Pitch Information
In this paper, we investigate the source–filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the fi...
متن کاملBayesian group sparse learning for music source separation
Nonnegative matrix factorization (NMF) is developed for parts-based representation of nonnegative signals with the sparseness constraint. The signals are adequately represented by a set of basis vectors and the corresponding weight parameters. NMF has been successfully applied for blind source separation and many other signal processing systems. Typically, controlling the degree of sparseness a...
متن کامل